CDS

Accession Number TCMCG061C77848
gbkey CDS
Protein Id XP_042044156.1
Location complement(join(6770..6910,7148..7291,7375..7564,7943..8034,8143..8259,8491..8620,8861..8967,9108..9184,9420..9501,9700..9837,9948..10130))
Gene LOC121789880
GeneID 121789880
Organism Salvia splendens

Protein

Length 466aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA737421
db_source XM_042188222.1
Definition heparan-alpha-glucosaminide N-acetyltransferase-like isoform X3 [Salvia splendens]

EGGNOG-MAPPER Annotation

COG_category S
Description Protein of unknown function (DUF1624)
KEGG_TC -
KEGG_Module M00078        [VIEW IN KEGG]
KEGG_Reaction R07815        [VIEW IN KEGG]
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko00002        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K10532        [VIEW IN KEGG]
EC 2.3.1.78        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00531        [VIEW IN KEGG]
ko01100        [VIEW IN KEGG]
ko04142        [VIEW IN KEGG]
map00531        [VIEW IN KEGG]
map01100        [VIEW IN KEGG]
map04142        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGCTTCAATCACGGCGGTATCTGCTGACGCCGGCGGAGACCGTACGCCGCTTCTCCACGACTCTAACATGGAATATTCTGGCGGAGAGGCGGAGAATACTATACCGCCACCCCCTATGATTTCAACTAATCCTAAACAGCGCCTCGTATCGCTCGACGTTTTCCGAGGACTCACTGTTGCGTTGATGATATTGGTTGATGATGCTGGAGGAGCATTTCCGTCAATAAACCATGCGCCGTGGTTCGGATTGACTCTCGCAGACTTTGTTATGCCGTTTTTTCTCTTTGGAGTTGGTGTTTCTGTCAGTCTCGTGTTCAAGAAAGTAGAAAACAGATTAGAAGCCACGAAGAGAGTAATGGAAAGATCTATCAAGCTCTTTCTCCTAGGAATGATCCTTCAAGGTGGTTATTTTCATGGCCGTGGTGATTTAACTTATGGAATTGATCTGGAGAAAATACGAATAATGGGTGTGCTACAGAGGATTGCTATTGGATATATGCTGGCTTCAGTTTTGGAGATATGGCTTGTAAACAATACTGCTGTTAATTCAGCAATAACATTCATGAAGAGATATTCGTATCAGATCGTGGCTGGAATTTTAGTAGGTGTAATATACATGGGTCTGCTGTATGGCCTTTATGTTCCAGACTGGAACTTTGATGCATCAAGTCTCAGGATGACCCAACGAGTGCTACTTGGTGCTAATTCTCAAACTGTCTACTGTGGGATAAGGGGTAGCCTTGAACCTCCTTGCACTGCAGTTGGTTTCATTGATCGAGTTATTATCGGTGAAAATCATCTGTATCAACGTCCTGTATATAGAAGAACCAAGGAATGCAGTGTCAATTCGCCAGATTATGGTCCTCCACCACCAAATTCCCCTGCATGGTGTCTTGCTCCATTTGATCCAGAGGGTATTTTAAGGTACTTCTGTTTTCTAGTGAGAATCTCTGCGTTTTCACTTCATGTAATGTACTTCATATTCCCGGAAATTCGCTACTATATACTCTTCATTTCTCCAGGTGTCCCACTTTCTAAACCACTGTACACATTGAGCTATATGTTTGTCACAGCAGGGGCATCTGGTGTTCTTTTGACCGTTATCTATTTTATCGTCGACACAAGATGCATTAGGAAGCCTACTCTATTATTCCAGTGGATGGGAATGAACGCTCTTGTAGTGTATGCTCTGGCTGCTTGCGAAATATTTCCAGCTGCTGTTCAAGGTGTCTACTGGCGTTCGCCTGAAAACAATCTGGTAGACTTGTCGGAAAAGCTACTGCAATGGGTCTTCCATTCAGATAAGTGGGGTACGTTGATCTTTGTTTTTGTGGAGATCTTATTTTGGGGTTTTGTTGCTGGTTTCCTCCATTTCAAACGTGTATATATAAAATTTTGA
Protein:  
MASITAVSADAGGDRTPLLHDSNMEYSGGEAENTIPPPPMISTNPKQRLVSLDVFRGLTVALMILVDDAGGAFPSINHAPWFGLTLADFVMPFFLFGVGVSVSLVFKKVENRLEATKRVMERSIKLFLLGMILQGGYFHGRGDLTYGIDLEKIRIMGVLQRIAIGYMLASVLEIWLVNNTAVNSAITFMKRYSYQIVAGILVGVIYMGLLYGLYVPDWNFDASSLRMTQRVLLGANSQTVYCGIRGSLEPPCTAVGFIDRVIIGENHLYQRPVYRRTKECSVNSPDYGPPPPNSPAWCLAPFDPEGILRYFCFLVRISAFSLHVMYFIFPEIRYYILFISPGVPLSKPLYTLSYMFVTAGASGVLLTVIYFIVDTRCIRKPTLLFQWMGMNALVVYALAACEIFPAAVQGVYWRSPENNLVDLSEKLLQWVFHSDKWGTLIFVFVEILFWGFVAGFLHFKRVYIKF